Representing and Querying Standoff XML
نویسندگان
چکیده
The paper discusses the representation and exploitation of multi-level annotated linguistic data. We first present a standoff XML representation, which distributes information over separate, standoff layers and allows us to represent annotations of various kinds in a uniform, generic way. This format serves as our interchange format. We further introduce an XML-inline representation that is designed to provide for a more efficient processing of the data. This format is computed on the basis of the standoff representation and uses fragments to represent overlapping elements. We then compare both representations by testing their performance with regard to a testsuite. Not surprisingly, the inline variant performs much better than the standoff variant, in particular with more complex queries.
منابع مشابه
Efficient XQuery Support for Stand-Off Annotation
XML annotations are a widely occurring phenomenon in many application fields, and XML databases should be used to store and query such data. To provide intuitive and fast querying of annotations, we make a case for extending XPath with four new axis steps, that correspond with socalled StandOff joins, introduced here. The new steps can be efficiently implemented using a region index and fast lo...
متن کاملQuerying Embedded RDF Data with XML Technology: A Feasibility Study
XML has become the de facto standard for representing and accessing data on the Web. At the same time RDF is becoming more and more popular for representing metadata. While RDF also has an XML-based syntax, storage and query technologies for the two formats are not compatible due to differences in the data model. This is a potential problem when trying to query data that combine XML data with R...
متن کاملValidity-Sensitive Querying of XML Databases
We consider the problem of querying XML documents which are not valid with respect to given DTDs. We propose a framework for measuring the invalidity of XML documents and compactly representing minimal repairing scenarios. Furthermore, we present a validity-sensitive method of querying XML documents, which extracts more information from invalid XML documents than does the standard query evaluat...
متن کاملRepresenting and Querying the Evolution of Databases and their Schemas in XML
We show that XML views combined with XML query languages can provide surprisingly effective solutions to the problem of representing and querying the evolution of databases—both the evolution of their contents and the evolution of their schemas. Indeed, using XML, the histories of database relations can be represented naturally by means of temporally grouped data models. We show that schema cha...
متن کاملRepresenting and Querying Multi-dimensional Markup for Question Answering
This paper describes our approach to representing and querying multi-dimensional, possibly overlapping text annotations, as used in our question answering (QA) system. We use a system extending XQuery, the W3C-standard XML query language, with new axes that allow one to jump easily between different annotations of the same data. The new axes are formulated in terms of (partial) overlap and cont...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007